Automatic pronunciation evaluation and classification
نویسندگان
چکیده
Pronunciation evaluation is an important module of every spoken language evaluation system. Automatic evaluation of quality of pronunciation that can mimic the performance of human assessors is a difficult task as human assessment accounts for several nuances of pronunciation including vowel substitutions and quality of consonants. This paper presents a novel approach that combines the knowledge of human assessment and the knowledge of the behaviour of automatic speech recognition systems to develop features for pronunciation evaluation. Instead of presenting the correlation of the proposed features with human assessment, the paper presents sentence-level classification accuracies which can directly be used in real-life applications. Inter-human and intra-human agreements, which are indicative of human subjectivity, are also presented. The trends in confusions among humans scores and automatic scores are compared as the number of classification classes is varied.
منابع مشابه
Phonetic Distance Based Accent Classifier to Identify Pronunciation Variants and Oov Words
The state-of-the-art Automatic Speech Recognition (ASR) systems lack the ability to identify spoken words if they have non-standard pronunciations. In this paper, we present a new classification algorithm to identify pronunciation variants. It uses Dynamic Phone Warping (DPW) technique to compute the pronunciation-by-pronunciation phonetic distance and a threshold critical distance criterion fo...
متن کاملPronunciation Verification for Automatic Litera
Arguably the most important part of automatically assessing a new reader’s literacy is in verifying his pronunciation of read-aloud target words. But the pronunciation evaluation task is especially difficult in children, non-native speakers, and pre-literates. Traditional likelihood ratio thresholding methods do not generalize easily, and even expert human evaluators do not always agree on what...
متن کاملAutomatic evaluation of quantity contrast in non-native Norwegian speech
Computer assisted language learning (CAPT) has been shown to be effective for learning non-natives pronunciation details of a new language. No automatic pronunciation evaluation system exists for non-native Norwegian. We present initial experiments on the Norwegian quantity contrast between short and long vowels. A database of native and non-native speakers was recorded for training and test re...
متن کاملPronunciation verification of children²s speech for automatic literacy assessment
Arguably the most important part of automatically assessing a new reader’s literacy is in verifying his pronunciation of read-aloud target words. But the pronunciation evaluation task is especially difficult in children, non-native speakers, and pre-literates. Traditional likelihood ratio thresholding methods do not generalize easily, and even expert human evaluators do not always agree on what...
متن کاملAuditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations
This paper expands our previous work on automatic pronunciation error detection that exploits knowledge from psychoacoustic auditory models. The new system has two additional important features, i.e., auditory and acoustic processing of the temporal cues of the speech signal, and classification feedback from a trained linear dynamic model. We also perform a pronunciation analysis by considering...
متن کامل